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NON -PARAMETRIC  ESTIMATION  OF  LOCATION 
by 

M.  V.  Johns,  Jr. 

1.  Introduction. 

The  problem  of  "efficiency-robust"  estimation  of  location  for 
symmetric  continuous  distributions  ha3  been  treated  by  a  number  of 
authors  (as  noted  in  the  text  below)  employing  various  optimality 
criteria.  The  present  paper  develops  a  sequence  of  estimators, indexed 
by  an  integer  valued  parameter  k,  exhibiting  tie  following  rather 
strong  notion  of  asymptotic  "efficiency -robustness":  For  any  k  the 
corresponding  estimator  is  consistent  and  asymptotically  normally 
distributed  (as  the  sample  size  n  increases)  for  any  F  in  a  large 
subset  3  of  the  class  of  symmetric  continuous  distributions.  Further¬ 
more,  for  any  e  >  0  the  variance  of  the  limiting  normal  distribution 
exceec.i  the  Cramer-Rao  bound  for  F  by  no  more  than  e  uniformly  for 
all  F e3,  for  each  sufficiently  large  k.  Thus,  for  large  k  the 
corresponding  estimator  is  (nearly)  Best  Asymptotically  Normal  (BAN) 
for  al]  Fe  $  . 

This  concept  of  optimality  is  distinguished  from  some  others  des¬ 
cribed  below  in  that  the  class  3>  is  "non-parametric"  and  indeed  con¬ 
tains  the  simple  parametric  families  used  in  other  definitions. 

Of  greater  significance  is  the  fact  that  the  simplest  non-trivial 
estimator  in  the  proposed  sequence  (corresponding  to  k  =  2)  exhibits 
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quite  high  efficiencies  for  small  to  moderate  sample  sizes  (n  =  10,20,1*0) 
for  a  collection  of  diverse  distributions  consisting  of  the  normal,  the 
Cauchy,  the  logistic,  the  double  exponential,  and  the  10$  contaminated 
normal.  These  efficiencies  relative  to  the  best  linear  unbiased  estimates 
(BLUE)  based  on  order  statistics  were  obtained  by  Monte  Carlo  experiments. 
The  estimator  seems  slightly  less  efficient  for  short -tailed  distribu¬ 
tions  compared  to  some  competitors  designed  particularly  for  vise  with 
such  distributions,  but  it  does  considerably  better  for  the  long-tailed 
Cauchy  case.  The  asymptotic  efficiency  of  the  estimator  may  be  computed 
by  hand  without  difficulty. 

This  estimator  is  no  more  difficult  to  compute  than,  say,  a  one 
sample  t -statistic,  and  the  calculation  yields  as  a  by-product  a  good 
estimate  of  the  variance  of  the  location  estimator  so  that  approximate 
confidence  intervals  and  tests  may  be  constructed.  The  author  believes 
this  computational  simplicity  to  be  one  of  the  most  important  features 
of  the  proposed  estimator  since  it  enhances  the  possibility  that  it 
may  be  found  useful  by  statistical  practitioners  dealing  with  actual 
data.  Previously  proposed  procedures  exhibiting  comparable  efficiency- 
robustness  require  complex  arithmetic  manipulations  and  some  involve 
table  lookups  or  matrix  inversions.  Considerations  of  simplicity  have 
previously  motivated  interest  in  very  easily  computable  estimators  such 
as  trimmed  or  Windsorized  means,  symmetric  linear  combinations  of  three 
order  statistics,  etc.  (see,  e.g.  Gastwirth  and  Cohen  [7)).  Such 
estimators  do  not  exhibit  as  broad  a  spectrum  of  efficiency-robustness 
as  those  considered  in  this  paper,  nor  are  they  associated  with  natural 


non-parametric  estimates  of  their  variances. 

The  various  approaches  to  efficiency-robust  estimation  may  for 
convenience  be  classified  into  two  categories:  (l)  those  achieving  asymptotic 
optimality  for  each  member  of  a  specified  finite  or  parametric  family 
of  distribution  "shapes”  (with  unspecified  scale  parameters),  and 
(2)  those  achieving  asymptotic  optimality  for  all  members  of  a  non- 
parametric  family  of  distributions  limited  only  by  regularity  conditions. 
Examples  of  the  first  category  of  estimation  procedures  are  given  by 
A.  Bimbaum  and  his  associates  in  [2],  [3],  [4],  J.  Gastvirth  and 
H.  Rubin  [6],  R.  V.  Hogg  [8],  and  V.  Mik^  [10].  In  these  papers  +'.e 
estimators  depend  explicitly  on  the  parametric  collection  of  distribu¬ 
tion  shapes  chosen  and  "optimality"  means  the  asymptotic  achievement 
of  the  Cram^r-Rao  lower  bound  for  the  variance  of  the  estimator,  or  the 
maximization  of  the  minimum  variance  over  the  specified  family  of  dis¬ 
tribution  shapes, or  admissibility  with  respect  to  the  family.  In  all 
of  these  cases  it  is  reasonable  to  expect  ( and  has  to  some  extent  been 
demonstrated)  that  the  proposed  estimators  will  be  robust  in  the  sense 
of  performing  well  for  distributions  which  do  not  differ  too  much  from 
the  families  in  terms  of  which  they  are  defined.  A  related  approach 
uses  the  data  to  select  one  estimator  from  a  small  specified  collection 
of  candidates.  Such  procedures  have  been  proposed  by  P.  Switzer  [12] 
and  L.  Jaeckel  [9]. 

The  approaches  failing  into  the  second  category  mentioned  above 
are  inherently  more  ambitious  in  their  goals  since  they  aim  at  asymp¬ 
totic  optimality  for  all  distributions  in  non -parametric  families 
constrained  only  by  regularity  conditions.  Estimators  are  regarded 
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as  optimal  if  they  achieve  the  Cramdr-Rao  bound  asymptotically  or  if 
they  are  BAN  (or  nearly  BAN).  The  possibility  of  achieving  this  kind 
of  uniform  efficiency  was  discussed  as  early  as  1956  by  C.  Stein  [11]. 
Recent  proposals  for  such  uniformly  efficient  estimators  have  been 
made  by  P.  K.  Bhattacharya  [ l] ,  C.  Van  Eeden  [l4]  and  K.  Takeuchi  [15] . 
The  first  two  of  these  papers  suggest  estimators  involving  pointwise 
estimates  of  density  functions  and  are  likely  to  require  rather  large 
samples  to  be  effective.  Takeuchi' s  ingenious  sequence  of  estimators 
(indexed  by  an  integer  k)  is  not  only  asymptotically  optimal  for 
large  values  of  k,  but  shows  good  small  sample  efficiency-robustness 
(for  small  values  of  k) .  A  natural  non-parametric  estimate  of  the 
variance  of  the  estimator  is  also  provided. 

The  estimators  proposed  in  the  present  oaper  fall  into  the  second 
category  described  above  and  (like  those  o.  Hogg  [8]  and  Takeuchi)  they 
are  "quasi -linear"  in  that  they  are  based  on  linear  combinations  of  the 
order  statistics  with  the  optimal  coefficients  being  estimated  from 
the  sample.  The  basic  statistic  from  which  the  proposed  estimators  are 
developed  consists  of  a  linear  combination  of  the  order  statistics  with 
the  coefficients  constrained  to  be  equal  within  each  of  several  sets 
corresponding  to  blocks  of  successive  order  statistics.  Thus,  the 
number  of  distinct  values  assumed  by  the  coefficients  is  some  number  k 
which  is  less  than  the  sample  size  n,  and  one  can  hope  to  use  the 
full  sample  to  approximate  the  k  optimal  values  for  the  coefficients. 
This  type  of  statistic  does  not  seem  to  have  been  considered  before  in 
the  literature  although  it  is  similar  in  spirit  to  linear  combinations 
of  selected  sample  quantiles,  and  such  statistics  have  been  discussed 


*t  length  In  many  papers.  The  proponed  estimation  lekm  eat  its  anf- 
tetle  properties  ere  Mailed  la  Beotian  2 .  In  Section  3  the  special 
case  k  -  2  is  discussed  sad  sew  Hants  carle  results  are  given,  me 
proof  of  the  theorem  stated  in  .Section  2  is  outlined  in  the  Appendix. 


2.  The  Proposed  getlmator. 

let  X^Xgr •  -,Xn  represent  independent  identically  distributed 

Observations  with  cosmon  distribution  function  F^x)  *  fix- 9)  share 

P  is  a  member  of  a  family  of  of  sptmetric  continuous  distributlaas 

aadlsfpiag  certain  regularity  cenditiens  described  belev.  Thus,  4 

represents  the  Median  of  F@  .  Let  <  \  <  •••  <  Yq  represent  the 

erdexed  X^'s  (i.e.,  the  "order  etatietice") .  Stqjpose  that  n  IS 

even  and  let  VV^  *  a  partition  of  the  Integers  ±,2,-",v/2, 

share  Tin  consists  of  *10  successive  integers,  so  that  I  *!-  -  «/2  • 

14) 

We  first  consider  estimators  of  9  of  the  farm 


(1) 


V 


j  'I 

,n-'i^L  *n 


.  sbere 


ln 


and  the  0^ ' s  are  constants  satisfying 

<*> 


l-i 


inin  *  f  * 


Condition  (2)  insures  that  ^  is  i  location  invariant  estimator.  Rote 
that  the  t  smallest  and  largest  order  statistics  have  been  "trimaed" 


5 


K 


from  the  estimator  SL  .  This  permits  the  development  of  substantially 

sn 

simpler  formulae  than  would  otherwise  be  possible.  Suppose  further  that 

c  -»  c, ,  i  =  1,2,  •  •  *,k,  and  t.  /n  -»  p. ,  i  =  0,1,2,  *  •  *,k,  as  n  -* «, 
in  i  1 

k  i 

Where  for  each  i,  p,  >  0  and  [  p.  =  r  .  Then  we  must  also  have 

1  i=0 


from  (2) 


(3) 


i=l 


c.p. 

ii 


where  c  =  ( c^Cg,  •  •  -,ck)  and  p  =  (P1,P2,  "  ‘,Pk)  •  Uhder  these  assump¬ 
tions  and  the  regularity  conditions  on  F  given  below,  for  fixed  k 
the  sequence  >/n  (0-0)  is  asymptotically  normally  distributed  with 


mean  zero  and  variance  cr^  given  by 


(U) 


k  k 

t7  =  2  Y  [  c,c,a,.  =  2cAc '  , 

k  jto  1  J 


where  A  =  ((a^))^^  811(1  the  aij's  are  as  follows:  Let 

"til 

t±  be  the  (p0+p1+*‘*+pi)  quantile  of  F,  i.e.,  F(  6k)  =  P0+Pl+" -Pi^ 
for  i  =1,2,  *  **,k-l,  and  =  0  .  Then  for  i  =  1,2,  •••,k 


‘ii 


rh  rZ± 

=  2£.  /  F(x)dx  -  2  /  xF(x)dx,  and 

Ji-1  K-i 


(5) 


aij  =  aji  =  (lj'^-l) 


fh > 


F(x)dx,  i  <  j  . 
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The  asymptotic  normality  of  0^  and  the  expression  for  <r£  follow 
directly  from  Theorem  3,  page  63,  of  Chemoff  et  al.  [5]. 

At  this  point  it  is  clear  that  since  the  a.  's  sure  invariant 
tinder  shifts,  one  could  estimate  them  by  replacing  the  ^'s  by  their 
sample  analogs  and  F(x)  by  the  sample  c.d.f.  Fn(x)  .  One  could 
then  choose  the  c. 's  so  as  to  minimize  ( U)  using  the  estimated  a,  ,'s 

^  *  J 

and  use  the  restilting  c^'s  in  .  Such  estimators  would  doubtless 
be  (nearly)  SAN  for  all  Fee*  for  sufficiently  large  k  and  small 
(the  trimming  proportion).  The  computation  of  such  estimates  would, 
however,  be  quite  complicated  and  the  minimization  process  would  require 
the  inversion  of  a  kxk  matrix.  We  proceed,  therefore,  to  introduce 
some  simplifying  approximations. 

For  i  =  1,2, * • • ,k  let 


(6) 


i-1 


di  =  *i  *i-l’ 


+  h 


and  b,  =  £  P,  +  =-Pi 
3=0  J 


Let 


(7) 


and 
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(8) 


R 


((ai/did;J^kxk 


-  B 
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and 


where  the  a^‘s  we  given  by  (5)* 

Then  if  c  =  =  (c^, c^,  •  •  *,0^), 

P  =  (Pi/di>P2/<V"’Pk/dk)»  **  have  from  (7)  and  (8) 

(9)  a|  =  c(BfR)c'  , 
and  condition  (5)  becomes 

(10)  ?P*  =|  . 


*  * 


«• 


It  is  shown  in  the  Appendix  that  for  the  cases  of  interest  the  contri- 

M  2 

button  of  cRc'  to  is  negligible  for  large  k  .  Hence  we  may 

O  M  M 

minimize  cr^  approximately  by  minimizing  cBc '  subject  to  condition  (10). 

A  matrix  of  the  form  (7)  is  non-singular  if  the  b^'s  are  distinct, 

and  a  straightforward  Lagrange  multiplier  argument  establishes  that  the 

vector  c  minimizing  cBc '  subject  to  (10)  is  given  by  c  =  i(  pB-1? ’ ) _1B-1p ' , 

2  ~2 

and  the  corresponding  approximate  minimum  of  cr.  ,  denoted  by  d  ,  is 
given  by  =  2cBc'  =  ifpB_1p')~1  .  Now  letting  e  =  ( e  ,e,,,  •  ••  ,e^) 
where  e^  =  d^1  X  ( ith  component  of  B  ^p ' ),  we  may  write 

(n)  =  £  (ep')ml, 

and  recalling  that  c^  =  c^/d^,  the  (approximately)  minimizing  vector 
c  is 

(12)  c  «  afe  . 


The  matrix  B  given  by  (7)  is  a  Green's  matrix  and.  hence  readilj  invert¬ 
ible  yielding  the  Jacobi  matrix  B-1  =  ((b^))..  vith 

JKX& 


bi,i+l  =  bi+l,i 


-2 


Pi+Pi+1 


>  i  -  l>2,°**,k-l  , 


(13) 


11  4V4pl+2p2 

b  =  (2p0+p1)(p1+p2)  9 


,  ii  2pi-l+^pi+2pi+l  ,  .  , 

b  ■  cp-^h vPitj  •  1=2’5’-'k-1 


kk 


pk-l+pk  ' 


and  all  other  bii  =  0  .  Thus  the  components  of  the  vector  e  are 


(H) 

1 


f2p1(2p0+2p1+p2)1 

1  1 

/2P2  1 

1 

61  " 

l(2Vpi 

4 

(v>2j 

SS  ' 

1 

2pi-;L  1 

-k.  (  2pi*i  ' 

1 

l\(pi 

-l+Pi)(Pi+IWj 

di  ' 

lPi -l+pi 

dt-i 

di+l  ' 

1  =  2,5, • • *,k-l, 


e,_  = 


2P„ 


2pk-l 


Pk-1+Pk  | 


pk-i+pk  I  VA 


Note  that  for  the  special  case  p  =  p  =  • • •  =  p  ,  the  formula  for 
e.^  for  i  =2,5,***,k-l  simplifies  to 


(15) 


9 


The  only  quantities  appearing  in  (14)  which  depend  on  the  distribu¬ 
tion  F  are  the  d^s  which  are  differences  of  quantiles  which  may  be 
effectively  estimated  by  their  sample  analogs.  For  each  i  let  ^ 

A  A  ^ 

be  the  sample  quantile  corresponding  to  ^  and  let  ^  . 

(More  explicit  formulas  for  the  ct^’s  are  offered  in  Section  3)*  Let 
e  be  the  vector  whose  components  e^  i  -  l,2,***,k,  are  obtained  from 
(13)  by  replacing  the  di's  by  ^’s  .  Then  following  (11)  and  (12) 
let 


(16) 

(17) 


a2 

°kn 


=  |(ep') 


-1 


and 


A  a2  a 
c  =  CTtoe 


The  proposed  estimator  of  9  (by  analogy  with  (l))  is 


(18) 


I  I/An 


7 


where  is  the  i  component  of  £  . 

It  has  been  assumed  so  far  that  n  is  even.  The  case  of  n  odd 
may  be  treated  in  various  ways,  perhaps  the  simplest  of  which  is  to 
delete  the  sample  median  from  the  order  statistics  and  compute  c  on 
the  basis  of  the  even  sample  size  n-1  .  The  coefficient  c.  should 

K. 

then  be  modified  by  multiplying  by  t,.  _  ,/(t.  +1)  and  the  saaple 

Kp  n-j.  K,n-i 

median  should  be  added  to  S.  .  .  Expression  (18)  may  then  be  used 

K;  n*j. 

with  the  8,  's  replacing  the  S,  's  . 
i  y  n*j.  in 


10 


A 

To  facilitate  the  discussion  of  the  asymptotic  behavior  of  0^  we 
shall  assume  that  for  each  k, 


(19) 


%  "  P2  =  * 


•  =pk  =  k 


-1 


^2  '  p0^  =  %.  * 


i.e.,  each  Sin  for  i  >  1  is  a  sum  of  (approximately)  2nq^.  successive 
order  statistics. 

For  any  continuous,  twice  differentiable  symmetric  distribution 
F(x)  with  density  f(x),  the  Fisher  information,  for  a  location  parameter 
when  the  npQ  smallest  and  npQ  largest  order  statistics  have  been 
deleted  ( "trimned")  from  a  sang>le  of  size  n  is  given  by 


(20) 


I(F,p0)  =  2j^(f'(x))2(f(x))'1dx  +  2f2UQ)p“1, 


where,  as  before,  F(|0)  =  PQ  (see,  e.g.  [5)).  The  untrimmed  (full 
sample)  case  is  obtained  by  setting  pQ  =  0  (|  *  «),  and  eliminating 

the  second  term  on  the  right  of  (20).  The  Cram£r-Rao  lower  bound  for 
the  variance  of  any  estimator  of  location  based  on  a  trimmed  sample  of 
size  n  is  then  (nI(F,p^))  1  . 

We  now  define  more  precisely  the  family  of  distributions  for  which 
the  proposed  estimator  is  shown  to  be  asymptotically  (nearly)  efficient. 
Definition.  Let  &  be  a  family  of  symmetric  continuous  distribution 
functions  F(x)  with  corresponding  density  functions  f(x)  =  F'(x) 
such  that 


11 


( a ,)  f(x)  and  its  first  three  derivatives  exist  and 
are  continuous  vith  f(x)  >0  for  all  x,  and 
xf(x)  -»  0  as  x  -*  ±  oo, 

(b)  f(x)  and  its  first  three  derivatives  sore  uni¬ 
formly  bounded  for  all  Ye.& , 

(c)  l(F,0)  given  by  (20)  is  finite  and  uniformly 
bounded  away  from  zero  for  all  Fee?,  and 

(d)  for  each  Fe^  the  conditions  of  Theorem  3  of 

[5 ]  are  satisfied  for  estimators  of  the  form  ( l) . 

The  asymptotic  behavior  of  the  proposed  estimator  is  summarized 
in  the  following 

Theorem:  If  and  are  given  by  (l8)  and  (16)  respectively 

and  ( 19)  is  satisfied  with  the  trimsing  proportion  pQ  held  fixed, 

then  for  any  e  >  0  there  exists  a  k  such  that  for  each  k  >  k  , 

e  e* 

for  all  Fe^,  the  sequence  N/n(  §  -  9 )  is  asymptotically  normally 

Kii  n 

2 

distributed  with  mean  zero  and  variance  t^F),  as  n  -»  »,  where 

(21)  i(T)  £  iTr^T  *  £ 

and  furthermore,  with  probability  approaching  one  as  n  -> », 

<22)  i9L-4(f)i<£  • 

The  proof  of  the  theorem  is  deferred  to  the  Appendix. 
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If  the  trimming  proportion  pQ  is  allowed  to  decrease  we  have  the 
following  immediate 

Corollary:  Under  the  conditions  of  the  theorem,  for  any  e  >  0,  there 
exist  a  k^  and  p£  such  that  if  the  trimming  proportion  =  p^, 

then  for  each  k  >  k  ,  for  all  FeS"  the  conclusions  of  the  theorem 

£ 

hold  with  I(F,pQ)  replaced  by  l(F,0)  in  (21). 

Thus,  for  a  suitable  choice  of  k  and  p^  one  can  make  9 [  as 

0  kn 

near  to  being  BAN  as  desired,  uniformly  for  all  . 

The  actual  family  S'  for  which  the  results  of  the  theorem  and  its 
corollary  hold  is  larger  than  that  described  in  the  above  definition. 

It  is  shown  in  the  Appendix  that  these  results  hold  for  the  double  expo¬ 
nential  distribution  (which  is  not  in  <4*  as  defined),  and  by  implication, 
for  other  distributions  having  a  finite  number  of  simple  discontinuities 
in  f  and  f'  . 

One  can  conclude  from  the  theorem  that  for  any  particular  Y^.S’ , 
there  exist  sequences  knt  w  and  pQn*  0,  as  n  -+  »,  such  that  the 
corresponding  sequence  of  estimators  9  (using  trimming  proportions 

K.  •  n 

2  n  -1 
p  )  is  BAN  for  F  .  Also  o.  is  a  consistent  estimator  of  (l(F,0))_ 
G1  n,n 

In  order  to  assert  the  existence  of  a  single  pair  of  such  sequences 
yielding  a  sequence  of  estimators  which  is  BAN  for  all  YtS<  it  is  neces¬ 
sary  to  further  restrict  S'  so  as  to  insure  uniform  convergence  to 
normality  in  the  theorem  cited  from  ("p  ]  Questions  concerning  the  rate 
behavior  of  the  k^  and  pQn  sequences  necessary  for  this  uniform  BAN 
property  can  also  be  raised,  These  questions  seem  somewhat  uninteresting 
in  view  of  the  results  of  the  next  section  wherein  it  is  shown  that  very 
small  values  of  k  suffice  to  produce  good  asymptotic  efficiencies  as 


Li 


veil  as  good  small  sample  performance. 


3.  The  Cases  k  =  2  and  k  =  3.  Asymptotic  efficiency-robustness  is 
only  a  theoretical  curiosity  unless  it  is  accompanied  by  satisfactory 
performance  for  small  and  moderate  sample  sizes.  The  estimator  0^ 
given  by  (18)  reduces  to  the  familiar  trimmed  mean  for  the  case  k  =  1, 
so  ve  concentrate  on  the  next  simplest  cases  where  k  =  2  or  3>  with 
particular  emphasis  on  k  =  2. 

It  is  convenient  to  introduce  certain  notations!  simplifications 
appropriate  to  the  case  of. small  k  and  n  .  Referring  to  the  quantities 
involved  in  (l),  let  r  =  t^,  s  »  tlp  =  tgn  =  •  •  •  =  tk-1  ,  and 
t  a  tto  ~  ~  T  ~  ( k-l)s  .  Thus,  for  k  =  2, 


S0,n  ■  £<VW  ’ 


(23) 


r+s 

v  ■  £  / y  w 


n/2 


"2’n  “ 


We  now  present  explicit  formulas  for  the  estimates  3^  of 
d^=li»5il.  Any  suitably  defined  sample  analogs  of  and 

A 

will  suffice  to  define  d^  but  since  represents  the  quantile  which 

is  the  upper  endpoint  of  the  quantile  range  corresponding  to  the  sample 
quantiles  contributing  to  S^n  and  the  lower  endpoint  of  the  corres¬ 
ponding  range  for  S.  .  it  seems  reasonable  to  choose  for  £.  the 

j.+Xjn  1 

midpoint  between  the  largest  order  statistic  contributing  to  and 

lk 


1 


,1 


the  smallest  contributing  to  This  leads  to  estimates  d^ 

given  by 

a  i  . 

.  di  ”  £  Yr+is+Yr+is+l+Yn-r-(i-l)s+Yn-r-( i-l)s+l 

—  Y  / ,  .  \  -Y  t  ,  n  \  >  -Y  -Y  -  ),  i=l,2 ,  •  •  • , k~l, 

r+(i-l)s  r+(i-l)s+l  n-r-is  n-r-is+1 

dk  =  ir(  Y(n/2)+t+Y(n/2)+t+l"Y(n/2)-t“Y(n/2)-t+l]  ’ 

k  =  2  and  even  n,  noting  that  pQ  =  r/n,  =  s/n,  and 
we  have  from  (l4)  and  the  definition  of  e, 

*  =  (2s(2r+2s+t )  1  _1  _  1 2t_|  1 

el  |(2r+s/fs+t) }  ^2  “  js+t|  *  £  * 

/S  _  |2t_\  _1  j  2s  \  1 

'2  "  H  %  ■  M  ' 

and  from  ( l6 ) ,  ( 17 ) ,  and  ( l8 ) , 


(24) 


Thus,  for 


P2  t/n, 


(25) 


(26) 


(27) 


a2 

a2,n 


A 

9. 


n 


2(  s^n^) 

e  S  +e  S„ 

1  l.n  2  2, n 


and 


2,n  2(se1+te2) 


Similar  simple  formulas  result  for  the  cane  k  =  3. 

For  given  values  of  pQ,  p  ,  and  p^  and  any  particular  distri- 

A 

bution  F,  the  asymptotic  variance  of  90  may  be  calculated  by  first 

d,  n 

calculating  the  four  a1j's  given  by  (5)  and  c  -  (c1^c2)  given  by  (12) 
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and  then  applying  ( 4) .  Asymptotic  variances  for  five  cases  were  computed 
and  the  efficiencies  relative  to  (l(F,0))_1  are  shown  below. 


Asymptotic  Efficiencies  for 


Table  1 

4,n  Vfa£n  PQ  =  Pi  =  P2 


.225 


Distribution 

Normal 

C.N.(  .10) 

Cauchy 

Dble.  Exp. 

Logistic 

|Efficiency  ($) 

94.4 

93.6 

97-2 

84.5 

9Y.5 

A  Monte  Carlo  experiment  was  performed  for  the  caseB  k  -  2  and  3, 

using  4000  replications,  for  the  normal,  contaminated  normal  (C.N.(.IO), 

Cauchy,  double  exponential,  and  logistic  distributions.  (The  C.N.( .10) 

distribution  is  a  standard  normal  with  a  10$  contamination  by  a  normal 

with  mean  zero  and  variance  =  9.)  Sample  sizes  n  -  10,20,40,  and  80 

were  used,  and  the  trimming  proportion  p^  (from  each  tail)  was  taken 

to  be  .05  for  all  cases  except  n  =  10  for  which  pQ  =  .10  .  For  the 

case  k  =  2,  for  n  >  10,  the  quantities  s  and  t  were  chosen  in 

three  ways:  ■ i)  2s  =  t,  !ii)  s  =  2t,  and  (iii  s  «t  ( sub ject  to  the 

limitation  that  s  and  t  must  be  positive  integers  with  s  +  t  =  ^  -  r) . 

For  n  =  10,  s  and  t  were  set  equal  to  two.  The  case  k  =  3  was 

computed  for  n  =  20,40,  and  80  with  pQ  =  .05  and  s  =  t  . 

Table  2  shows  the  mean  squared  errors  (MSE)  and  relative  efficiencies 

of  sL  ,  and  the  means  of  ,  for  the  case  k  *  2,  s  *  t  .  Ideally 
2,n  d,  n 

the  small  sample  efficiencies  should  be  computed  relative  to  the  variances 
of  the  minimus  variance  location  invariant  ( Pitman)  estimators,  but  these 
were  only  available  for  the  double  exponential  distribution  (from 
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Table  2 


Efficiencies  for  Caae  k  =  2,  s  «=  t 


Sample  Size 

Distribution 

Var(BLUE) 

MSE(6p  ) 

Ef  f .  (  ©  ' 

2:»n' 

n  =  10 

Normal 

.1000 

.n84 

.0892 

84* 

C.N.( .10) 

.1358 

.11*88 

.1245 

91 

Cauchy 

.3261 

.1*229 

.4841 

77 

Dble.  Exp. 

•  1399 

.1645 

.1399 

85 

Logistic 

•3C73 

.3446 

.2750 

89 

n  =  20 

Normal 

.0500 

.0579 

•0525 

86 

C.N.( .10) 

.0657 

.0710 

.0712 

93 

Cauchy 

.1256 

.1304 

.1429 

96 

Dble.  Exp. 

.0637 

.0703 

.0707 

91 

Logistic 

.1520 

.1668 

.1573 

91 

n  =  40 

Normal 

.0250 

.0284 

.0281 

88 

C.N. ( .10) 

.0315* 

.0354 

.0364 

89 

Cauchy 

.0500* 

.0589 

.0646 

85 

Dble.  Exp. 

.0297** 

.0333 

•0353 

89 

Logistic 

.0750* 

.0817 

.0822 

92 

n  =  80 

Normal 

.012  5 

.0137 

.0147 

91 

C.N. ( .10) 

.0157* 

.0170 

.0186 

92 

Cauchy 

.0250* 

.0269 

.0300 

93 

Dble.  Exp. 

.0142** 

.0155 

.0174 

92 

Logistic 

.0375 

.0393 

.0424 

95 

^  ^  K  ^ 

Cramer-Rao  bound  Pitman  estimator 
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Table  5 


Confidence  Interval  Error  Probabilities 

p<iSj  >»v>/s)  ,  (k=e,  s  *t) 


Distribution 

Sample  Size(n) 

7  =  1.645 

7  =  1.960 

7  =  2.57 

Normal 

10 

.208 

.143 

.074 

20 

.145 

.087 

-039 

40 

.119 

.066 

.020 

80 

.095 

.048 

.011 

C.N.( .10) 

10 

.196 

.136 

.068 

20 

-155 

.083 

.036 

40 

.117 

.065 

.021 

80 

.094 

.050 

.011 

Cauchy 

10 

.148 

.100 

.047 

20 

.124 

.076 

.030 

40 

.112 

.057 

.017 

80 

.095 

.048 

.011 

Dble.  Exp. 

10 

.176 

.120 

.058 

20 

.128 

.074 

.028 

40 

.ill 

.061 

.015 

8o 

.092 

.044 

.090 

Logistic 

10 

.200 

.138 

.070 

20 

.140 

.088 

.038 

40 

.117 

.065 

.019 

8o 

.094 

.048 

.011 

"t"-Dist . 

10 

.155 

.082 

.030 

PtlVll  >  7 ) 

20 

.117 

.065 

.019 

.108 

.057 

.014 

8o 

.104 

.054 

.012 

Normal 

pn^l>^ 

all  n 

.100 

.050 

.010 
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Table  4 


"Best  Case"  Efficiencies  (k  =  2) 

Sample  Size 

Distribution  n=20  n=40  n=60 


Best(s,t)  Eff.(  0 

T1)  Best(s.t)  Eff.(  9  ) 

n 

Best( s,t)  Eff .( 

Normal 

(3,6) 

90% 

(6,12) 

90% 

(12, 24) 

9ty 

C.N.( .10) 

(3,6) 

95 

(6,12) 

92 

(32,24) 

9  ^ 

Cauchy 

(6,3) 

100 

(32,6) 

87 

(24,32) 

96 

Dble.  Exp. 

(6,3) 

92 

(12,6) 

93 

(24,12) 

97 

Logistic 

(fc,5) 

92 

(9,9) 

92 

(12,24) 

96 

Table  5 

Ratio  of  Variance  for  k  =  3 

to  Variance 

for  k  =  2 

for  the  Case 

S  «  t 

Distribution 

n=20 

n=4o 

n=8o 

Normal 

1.03 

1.07 

1.05 

C.N.( .10) 

1.04 

1.05 

1.05 

Cauchy 

1.19 

1.11 

1.00 

Dble.  Exp. 

1.09 

1.08 

1.06 

Logistic 

1.03 

1.07 

1.06 
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V.  Hik£  ( 10 ] ) .  For  the  other  distributions  the  variances  of  the  BLUE's 

were  used  vhen  available,  and  for  the  larger  sample  sizes  the  Cramer -Rao 

bounds  were  used  which  accounts  for  the  apparent  slight  drop  in  several 

of  the  efficiencies  at  n=4o  compared  to  those  at  n=20.  The  fact 

that  the  efficiencies  shown  for  the  double  exponential  case  are  relatively 

high  compared  to  the  asymptotic  value  given  in  Table  1  is  a  consequence 

of  the  rather  slow  rate  at  which  the  variance  of  the  Pitman  estimator 

approaches  its  asymptotic  value  as  the  sample  size  increases . 

Table  3  shows  the  estimated  error  (non-coverage)  probabilities  for 

a  -1/2a 

confidence  intervals  of  the  form  9n  +  ra  an  •  The  values  of  y 

2,n  2,n  ' 

were  taken  (arbitrarily)  to  be  percentage  points  of  the  standard  normal 

distribution.  For  comparison  purposes  the  error  probabilities  are 

given  for  each  n  for  the  corresponding  confidence  intervals  based 

on  the  t -distribution  (assuming  normally  distributed  observations). 

The  distribution  of  the  "studentized"  '•  jrsion  of  shows  a  some- 

2,n 

what  greater  spread  than  the  corresponding  t-distribution  for  the 
smaller  sample  sizes.  For  each  pair  of  values  of  n  and  y  the  error 
probabilities  are  rather  similar  for  all  five  of  the  distributions 
considered,  and  they  do  not  differ  much  from  those  for  the  corresponding 
t-distribution  for  n=40  and  80. 

Table  4  shows  for  each  distribution  the  most  favorable  pair  of  values 
of  s  and  t  (among  the  three  possibilities  tested  for  each  n)  together 
with  the  corresponding  efficiencies  for  n  =  20,  40  and  8o.  For  all 


three  sample  sizes  the  large  values  of  t,  leading  to  large  groups  of 


central  order  statistics  receiving  weight  Cg,  tend  to  be  favorable  for 
the  relatively  short-tailed  distributions  (normal,  contaminated  normal, 
and  logistic),  while  the  reverse  is  true  for  the  long -tailed  cases 
( Cauchy  and  double  exponential) .  This  provides  considerable  justifica¬ 
tion  for  the  choice  s  =  t  as  a  suitable  compromise . 

A 


Table  5  shows  the  ratio  of  the  variance  of  0,  to  the  variance 
of  §2  ^  when  s  <*>  t,  for  n  *  2C,kO,  and  80,  for  each  of  the  five 
distributions.  The  uniform  superiority  of  the  estimator  using  k  =  2 
must  be  attributed  to  the  additional  sampling  fluctuation  introduced 
when  an  additional  coefficient  is  estimated  in  the  case  k  =  3>  together 
with  the  fact  that  the  efficiencies  are  already  high  for  the  case  k  =  2. 
In  a  preliminary  Monte  Carlo  investigation,  larger  values  of  k  were 
considered  and  in  all  cases  the  variances  of  the  estimators  for  a  given 
sample  size  tended  to  increase  with  increasing  k  . 

In  interpreting  the  Monte  Carlo  results  given  above  it  should  be 
borne  in  mind  that  the  standard  deviation  associated  with  each  of  the 
efficiencies  is  on  the  order  of  two  percentage  points.  It  is  perhaps 
worth  mentioning  that  the  ordinary  congruence -type  pseudo-random  numbers 
proved  to  be  completely  inadequate  for  this  investigation,  and  it  was 
found  necessary  to  introduce  a  suitable  "re-randomization"  procedure. 

A 


The  possibility  exists  that  the  performance  of  the  estimator  0g 
could  be  slightly  improved  by  a  minor  modification  as  indicated  in  the 
following 

Remark:  The  quantities  e^  and  3g  appearing  in  the  definition  (27) 


are  of  the  fora 


of 


A 


fi  -  “A2  -  • 

- 

where  o^,^,  i  =  1,2,  are  determined  by  (25).  Since  the  formulae  given 
in  (25)  were  obtained  by  an  asymptotic  argument  valid  for  large  k,  it 
is  remarkable  that  they  work  well  for  k  =  2 .  It  seems  entirely  possible 
that  a  somewhat  different  choice  for  the  o^’s  and  p^s  could  result 
in  uniformly  improved  performance.  The  author  conjectures  that  a  slight 
modification  of  (only)  could  lead  to  improved  efficiency  for  short¬ 
tailed  distributions  like  the  normal  without  materially  impairing  the  per¬ 
formance  of  the  estimator  for  long -tailed  distributions. 
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Appendix 


To  prove  the  theorem  of  Section  2  we  first  show  that  /n  (6^  -«3)  is 
asymptotically  normal  with  mean  zero  and  variance 


(28) 


Ck  -  I(F,p0)  +  6 


> 


for  arbitrary  fixed  e  >  0,  for  each  sufficiently  large  k,  for  all 
Fe  &  ,  when  9  is  given  by  (l)  with  the  s  given  by  (12).  The 
validity  of  (20)  follows  from  conditions  (a)  and  (c)  of  the  definition 
of  y  ,  and  asymptotic  normality  follows  from  condition  (d).  By 
virtue  of  expression  (9)  it  therefore  suffices  to  show  (i)  that  (28) 
holds  with  replaced  by  given  by  (ll),  and  (ii)  that 

IcRc'  |  <  e  for  all  sufficiently  large  k,  for  all  Fe  lr. 

To  verify  assertion  (i),  let  F  d(u)  =  G(dl),  and  referring  to  (19) 
let  r  =  p ■  +(i-l)q  ■  Then  by  definition  (6)  of  d  we  may  write 

1  U  K  i 


di  =  G^ri+qk^~G^ri^  >  i*l,2, 

di+1  =  G(ri+2<lk^"G(ri+(lk)  *  i=l,2,  ...,k-l, 

=  G(r^ )~G(r^-q^)/  i=2>3j...>k. 

Now  let  G1^  and  G”’  represent  the  first  three  derivatives  of  G 

evaluated  at  r^  Then  under  conditions  (a)  and  (b)  of  the  definition 
of  7f  ,  a  straightforward  Taylor' s  series  expansion  in  terms  of  qfc 
yields 


dIX  =  «k1(o;)'1-i<GP‘2G”i+V5(,5"1)2<0i>'5-5  o™( 0J)-2k4, 


dIn- <Gi)  -f‘Gi)  G'i+\i?<G’P2(Gi)'3-^Gr(GP'2w4' 

di'-i  ■  Gk<Gi>'1+i<Gi)'2G"1v^Gi)2(G;)'3-FGr‘Gi)'2H4> 

where  £  represents  a  generic  uniformly  bounded  function  of  i,  k, 
and  G.  Hence  by  ( 15),  for  1=2,5, •• .,k-l, 

e!  =  -  [2(G*’)2(G')'1'-G'"(G')’3]  +  Cqk  . 

Letting  f^,  f^,  and  f'j  be  f  and  its  corresponding  derivatives 
evaluated  at  G{t± )  we  have 


Gi  =  (fi)'^  Gl  “  G”1  =  3(f^)(fi)‘5-^(fi)"4  , 

so  that  et  =  -[f,i(f1)“1-(fi)2(f1)"2'J^qk. 

Also,  writing 

\  -  G(|)-G(|-<lk), 

Vi  -  g<?-V-g<2-  v 

and  referring  to  (l4),  we  obtain  by  a  similar  calculation 
<29)  ek-C-(i)to'(i;i-5  +  C,k=  -g02  ,Uk. 


Hence,  as  k  ' 


s  f, 

(30)  q  £  e1  -»  -  / 
i-2  1  Jp, 


m 


cgfcdll!}*., 

x  (g(u) )  J 


=  "  [  {  l°g  f(*)}  *(*)**  =  f  (f*  (x))2(f(x))_1dx  +  tl 

JQ(  p  J  l  dx^  J  Jfifn  )  3 


G(P0) 


and  the  convergence  is  uniform  for  all  Fe  Again  referring  to  (l1*). 


we  obtain 


ei  ■  A-  f,1K 


Thus,  from  (11),  (19),  (20),  (30)  and  (3l)  we  have 


2\  ei 
"k  1,1 

as  k  -»%  uniformly  for  all  Fe  ?  .  This  together  with  condition  (c) 
of  the  definition  of  >  implies  the  truth  of  assertion  (i). 

To  prove  assertion  (ii),  let  the  elements  of  the  matrix  R  given 
by  (8)  be  denoted  by  r^,  i,  j=l,2, . . ., k.  Then  by  (5)  and  (6)  we  have 
for  i  <  j 

,  ra(ri+«k) 


r  .,-.p 

J1  1  Mr.) 


f(x)dx  - bt  , 
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r  =  2d 
rii  : 


f  f°(v« *>  fa(ri+\>  i 

^  \G(ri+5lk)  I  F(x)dx  -  I  x  F(x)dxS  -bj  , 

[  ^G(ri)  -'G(ri)  j 


where  by  (6)  and  (19)>  bi  =  ri  +  2  qk‘ 

Expansion  in  Taylor’ s  series  with  respect  to  q^  yields  for  i  <  j, 
riJ  ri  +  2  qkGifi  *  ^ql "  bi  “  ^qk  ’ 


rll  '  ri  +  qk°’lf  1  '  V  5  qk  =  2  qk  +Uk 


-1  1  *2 

Since  qfc  =  k  (£-pQ)  ve  8ee  tliat  rij  =  °(k  )  uniformly  in 

l,j  and  Fe  31  for  i^j,  and  r^  =  0(k  1)  uniformly  in  i  and 
Fe  S’ .  Furthermore,  since  =  c^d^  where  the  c^s  are  given  by 
(12)  we  see  that  c  =  0(k  1)  uniformly  in  i  and  Fe  7F  ,  so  that 
cRc’  =0(k^),  uniformly  in  Fe  3~  which  proves  assertion  (ii). 

Conclusion  (21)  of  the  theorem  follows  from  (28)  provided  that  for 

—.A 

any  fixed  k,  /n  (0^-0^)  in  probability,  as  n  since  then 

2  2 

xk(F)  *  But  from  (18)  we  have 


v/n  (0,  -9  )  =  T  (c  -c  )(—  S.  )  . 

y  '  xn  kn'  i  i  /-  in' 

i=l  \fn 


Now  the  c^* s  given  by  (17)  are  consistent  estimates  of  the  corres¬ 
ponding  ci' s  given  by  (12)  since  the  d^  s  are  consistent  estimates 
of  the  corresponding  d^’s.  The  desired  result  follows  from  the 
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boundedness,  in  probability,  01  the  quantities  ~  S  ,  as  n 

YU  iB 

Conclusion  (22)  now  follows  from  the  fact  that  is  a  consistent  estimate 

of  which  by  assertion  (ii)  is  uniformly  close  to  for  sufficiently 

large  k.  This  completes  the  proof  of  the  theorem* 

We  consider  now  the  double  exponential  distribution  with  density 
1  -  I  x  I 

f (x)  =  —  e  .  The  expressions  for  e^  obtained  above  are  valid  for 
Is 1,2, . .  .,k-l.  Flrom  (31)  we  see  that  e^  =  £  for  this  case.  Also 
b2  k-1 

— =■  log  f(x)  =  0,  for  x  <  0,  so  that  q  £  e.  ->  0,  as  k  -*•». 
dx  *  i=l  1 

Expression  (29)  for  efc  is  not  valid  for  this  case  since  f' (x)  is 
not  continuous  at  x  =  0.  Direct  calculation  shows  that  e.  =  4q~1+0(l). 

K  K 

Therefore,  in  the  limit,  the  only  non-zero  c^  is  c^,  so  that,  as  k 
Increases,  9 ^  is  essentially  an  average  of  a  decreasing  proportion  of 
central  order  statistics.  The  conclusions  of  the  theorem  are  clearly 

A 

correct  for  the  corresponding  sequence  9^  even  though  the  double 
exponential  distribution  is  not  in  the  family  S'  as  defined. 
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